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Abstract. We investigate the equilibration of an isolated macroscopic quantum 
system in the sense that deviations from a steady state become unmeasurably small 
for the overwhelming majority of times within any sufficiently large time interval. The 
main requirements are that the initial state, possibly far from equilibrium, exhibits a 
macroscopic population of at most one energy level and that degeneracies of energy 
eigenvalues and of energy gaps (differences of energy eigenvalues) are not of exceedingly 
large multiplicities. Our approach closely follows and extends recent works by Short 
and FarrcUy [2012 New J. Phys. 14 013063], in particular going beyond the realm of 
finite-dimensional systems and large effective dimensions. 
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1. Introduction 

It is a basic everyday experience that isolated macroscopic systems, i.e. systems 
consisting of many particles or other microscopic degrees of freedom, approach some 
steady equilibrium state after a sufficiently long time evolution, no matter how far from 
equilibrium they started out. More precisely, for every single run of the experiment, one 
still may encounter certain statistical or quantum mechanical fluctuations, especially for 
microscopic observables, but on the average over many repetitions of the experiment, 
all expectation values appear to equilibrate. 

The reconciliation of this irreversible behavior with quantum mechanical revers- 
ibility and revival/recurrence properties is a long standing problem [1] which has 
recently be reconsidered from a new viewpoint, without focusing on any speciflc model 
class and without any modiflcation/approximation of the exact quantum mechanical 
time evolution [2-7]. The key point of these works is to show that the expectation 
values may still exhibit everlasting small fluctuations around their equilibrium values, 
as well as very rare large excursions away from equilibrium (including the above- 
mentioned recurrences), but quantitatively these fluctuations are either unobservably 
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small compared to any reasonably achievable resolution limit, or exceedingly rare on 
any realistic time scale after initial transients have died out. In this sense, the system 
indeed equilibrates. 

Originally, these conclusions have been based on the following main assumptions 
[2-5]: 

(i) The considered observables represent experimental measurement devices with a 
finite instrumental range and a possibly very small but still 'reasonable' resolution 
limit. 

(ii) The initial condition exhibits a small occupation probability of every single energy 
level, which is very plausible in view of the unimaginably large level density of 
typical macroscopic systems. 

(iii) All energy eigenvalues are non-degenerate. 

(iv) All energy gaps are non-degenerate, i.e. a pair of distinct energy eigenvalues never 
exhibits the same energy difference as some other pair. 

(v) Either the considered Hilbert space must be finite-dimensional [3, 4], or some formal 
manipulations are not rigorously justified [2, 5], in particular interchanging the limit 
of infinite dimensions with the long-time limit. 

The restrictions (iii) and (iv) have recently been overcome in two very important 
contributions by Short [6] and by Short and Farrelly [7]. In our present work, we 
closely follow and further extend their approach by relaxing also the above conditions 
(ii) and (v). Namely, we only require that the second largest level population must be 
small, while the occupation probability of one level may be macroscopic (non-small). 
Such a case may e.g. be of relevance for thermal equilibrium states at extremely low 
temperatures. Furthermore, we will admit and treat with care (countably) infinite 
dimensional systems. 

Our present approach also bears resemblance to recent advancements made by 
Goldstein and coworkers of ideas originally due to von Neumann [8-11]. The main 
difference is that in these works an alternative notion is adopted of when a system is in 
or close to equilibrium, and the main emphasis is laid on macroscopic (coarse-grained) 
observables, exhibiting the same expectation value for most states within any quantum 
mechanical energy shell. 

Furthermore, our approach is complementary to numerous recent investigations of 
equilibration for various specific systems, observables, and initial conditions, and often 
with a main focus on the role of (non-)integrability, see e.g. [12-28] and references 
therein. 

Another issue closely related to equilibration is the problem of thermalization, i.e., 
the question whether, and to what extend, the above mentioned equilibrium states agree 
with any one of Gibbs's statistical ensembles. This important issue, either for an isolated 
system per se or for an isolated system-plus-bath composite, has been recently addressed 
in e.g., [2-5, 8-24, 29, 30], but will not be considered here in any further detail. In other 
words, our notion of equilibrium is weaker than that of thermal equilibrium. 
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2. General Framework 

2.1. System and Hamiltonian 

We consider an isolated system, confined to a finite region of space and involving a finite 
number of particles. Later we will mainly be interested in macroscopic systems, but for 
the moment any finite number of degrees of freedom is admitted. In particular, we may 
be dealing with a compound system, consisting of a subsystem of actual interest and its 
environment (reservoir or thermal bath). 

According to standard quantum mechanics, such a system is modeled by a time- 
independent Hamiltonian H on a. separable (i.e. at most countably infinite-dimensional) 
Hilbert space "H. Since we consider the system to be confined to a finite region of space, 
all eigenvectors of H represent bound states and the spectrum of H is discrete (pure 
point). As a consequence, the Hamiltonian can be written in the form 

H = J2EnPn, (1) 

n 

where the P„ are projectors onto the eigenspaces of H with eigenvalues En, satisfying 

PmPn ^mnPni (2) 
n 

En ^ En,, iin^m. (4) 

Here, n,m G {1, . . . , rf^;}, where the number (Ie of distinct energy eigenvalues may be 
finite or infinite. The symbol Yin indicates a summation over all those n-values, 6mn is 
the Kronecker symbol, and 1 the identity on Ti. In particular, any energy eigenvalue 
En is allowed to be degenerate and its multiplicity is given by 

fin ■■= Tr{P„}, (5) 

where Tr denotes the trace on Ti. The dimension of "H thus amounts to J2n l^n and may 
be finite or infinite. 



2.2. States and dynamics 

The system's state at time t is captured as usual by a density operator p(t), describing 
either a statistical ensemble (mixed state) or a pure state, and evolving in time according 
to pit) = Utp{fi)Ul with time-evolution operator Ut := exp{— lift} and h= 1. With (1) 
we can conclude that 

p{t) = ^ Pmn(O) exp[-i(£'^ - En)t], (6) 

mn 

where we have introduced the auxiliary operators 

Pmnit) := PmP{t)Pn. (7) 

While Pnn{t) are thus time-independent and self-adjoint operators, the same generically 
does not hold for Pmn(t) when m ^ n. In particular, Pnnif) equals Pnn(O) and the 
time-argument will often be omitted. 
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2.3. Realistic observables 

Observables are represented as usual by self-adjoint operators A with expectation 
values TT{p{t)A}. In order to model real experimental measurements it is however not 
necessary to admit any arbitrary self-adjoint operator [31-38]. Rather, it is sufficient 
to focus on realistic observables in the following sense [2, 5, 39]: Any observable A 
must represent an experimental device with a finite range of possible outcomes of a 
measurement, 

Aa := sup {ip\A\i)) - inf (tp] A\tjj) = a^np - ai^i < oo, (8) 

where 

S{H) := {i'eH\ = i}cn (9) 

denotes the set of normalized vectors in "H. Moreover, this working range of the 
device must be limited to experimentally reasonable values compared to its resolution 
limit 6A. All measurements known to the present authors yield less than 20 relevant 
digits, i.e. 

Aa/SA ^ 10'°. (10) 

Maybe some day 100 or 1000 relevant digits will become feasible, but it seems reasonable 
that a theory which does not go very much beyond that will do. Note that similar 
restrictions also apply to 'numerical experiments' by computer simulations. 

According to (8), all eigenvalues of A must be contained within the finite interval 
[cinf , flsup] and the operator norm 

Pll := sup (11) 

i'€Sin) 

is finite and equal to max{|ainf|, |asup|}- (As usual, the vector norm on the right hand 
side of (11) is the one induced by the scalar product on "H). 



2.4. Level populations 

The specific observable A = Pn describes the population of the energy level En with 
expectation value (occupation probability) 

Pn := Tr{P„p(t)} = TT{pnn}. (12) 

The last relation shows the time- independence oi Pn and follows from (2), (7), and the 
invariance of the trace under cyclic permutations. 

For a system with / degrees of freedom there are roughly lO^*^-'^-* energy eigenstates 
with eigenvalues in every interval of IJ beyond the ground state energy ([40]; for a 
more detailed discussion, see also section 2.1 of [5]). The same estimate carries over 
to the number of energy eigenvalues under the assumption that their multiplicities 
(5) are much smaller than lO*-'^'^-'. For a macroscopic system with / = 0(10^^), the 
energy levels are thus unimaginably dense on any decent energy scale and even the most 
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careful experimentalist will not be able to populate only a few of them with significant 
probabilities p„. 

We recall that pit) may be a pure state, but the case of foremost interest is on 
mixed states describing a statistical ensemble. Then, p„ describes an ensemble average 
over many repetitions of the experiment. Hence, an 'accidentally large' population of 
a few levels in one particular experimental run is still admissible, but very unlikely to 
occur again when the experiment is repeated. 

To obtain a rough estimate, we imagine that there are exactly 10(^°'') energy 
levels per J. Even if the experimentalist can prepare the energy of the system with 
a fantastically small uncertainty of lO"'-^"^^'' J, there still remain := 10°-^^^''^^ energy 
levels which may be occupied with significant probabilities. If all of them are populated 
equally, we obtain p„ = for of the indices n, and Pn = for all other n. If not all 
levels are populated equally, but rather any p„ may assume arbitrary values between 
zero and lO*^^"^^-' times the average population we still obtain p„ ^ -j^g-o.sxio^s^ 

Returning to the general case, we can conclude [2, 5, 39] that even if the system's 
energy is fixed up to an extremely small experimental uncertainty, and even if the 
energy levels are populated extremely unequally, we still expect that even the largest 
ensemble-averaged level population p„ will be extremely small and typically satisfy the 
rough estimate 

maxp„ = 10-°(^\ (13) 



n 



2.5. Macroscopic population of one energy level 

There is one physically significant situation in which the above arguments may become 
questionable. Namely, for an isolated macroscopic system which approaches a thermal 
equilibrium state with an extremely low temperature, it might be conceivable that the 
ground state energy exhibits a macroscopic population, i.e. the corresponding p„ is no 
longer extremely small. Hence, we should omit that specific p„ in the maximization 
(13), formally written as 

max'pn = 10~°(-^). (14) 

n 

In other words, the prime indicates that the largest p„ is not included into the 
maximization and hence max„'p„ represents the second largest level population. 

Further situations resulting in a non-small population of one single level may be 
caused e.g. by certain 'gaps' in the energy spectrum or by one level with an extremely 
high multiplicity (5). 

Note that the expected relations Pn ^ and J2nPn = 1 readily follow from (3), 
(12), and the fact that p(t) is non-negative and of unit trace. We thus can conclude that 
the maxima max„p„ and max^p„ indeed exists — as anticipated in (13) and (14) — and 
that ^ Yl'n'Pn < 1) where the prime in Yl'n excludes the index n belonging to the 
maximal p„ . It follows that 

EpI ^ max'pnV' Pn ^ max'p„, (15) 
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/ / / 2\l/2 ^ /V^' 2\ 



(16) 




and we can conclude that 



max'pn small small. 



(17) 




In references [3, 4, 6, 7], the quantity d^s := 1/ ^nPni called the effective dimension 
of the state p(t), is introduced. It quantifies the number of distinct energies that 
contribute notably to this state, and is required to be a large number. Observing 
that the equivalence (17) also applies without primes, we see that the requirement of a 
large effective dimension is fulfilled if and only if the maximal level population max„ 
is small. However, in the more general case including the primes in (17), as considered 
in our present work, the effective dimension dgfr may not be large any more. 

2.6. Equilibration and equilibrium ensemble 

Generically, the statistical ensemble p{t) is not stationary right from the beginning, 
in particular for an initial condition p(0) out of equilibrium. But if the right hand 
side of (6) depends on t initially, it cannot approach for large t any time-independent 
'equilibrium ensemble' whatsoever. In fact, any mixed state p{t) returns arbitrarily close 
(with respect to some suitable distance measure in Hilbert space) to its initial state p(0) 
for certain, sufficiently large times t, as demonstrated for instance in appendix D of [41]. 

We will therefore focus on the weaker notion of equilibration outlined in section 1, 
requiring the existence of a time-independent 'equilibrium state' uj (density operator) 
with the property that the difference 



between the true expectation value Tr{p(t)/1} and the equilibrium reference value 
Tr{cj/1} is unresolvably small for the overwhelming majority of times t contained in any 
sufficiently large (but finite) time interval [0,T]. (Note that initial transients become 
irrelevant if T is chosen large enough.) 

Heuristically, if any such equilibrium ensemble exists, then it should be given by 
the infinite time average of p{t). In view of (6) this suggests the definition 



However, from a more rigorous viewpoint, it is not so obvious that averaging (6) over 
arbitrary but finite times leads to a well-defined long-time limit, which is furthermore 
given by (19). Specifically, for infinite-dimensional systems, interchanging the infinite 
time limit with the infinite double-sum in (6) is problematic. We avoid all these 
difficulties by defining u according to (19) without any reference to averages over 
time. An alternative and entirely unproblematic viewpoint is to consider uj as the 
time- independent part of p{t). 



a{t) := Tr{p(t)A} - Tt{ujA} 





(19) 



n 
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One readily sees that u inherits from p{t) the properties of being self-adjoint, 
non-negative, and of unit trace. Furthermore, u satisfies the trivial time evolution 
UtOoU} = u. In other words, u is indeed a perfectly well-defined density operator. 

We finally note that, as far as the differences in (18) are concerned, nothing changes 
if A is replaced hj A + cl with an arbitrary real c. Thus, we henceforth can assume 
without loss of generality that Osup = — Oinf in (8), implying with (11) that 

\\A\\ = Aa/2. (20) 
3. Prom infinite to finite dimensions 

We focus on infinite-dimensional Hilbert spaces H and denote the normalized 
eigenvectors of the Hamiltonian H by |z/) with u = 1,2,.... For any given positive 
integer d we define the projectors 

d 

P:=J2W)H (21) 
i/=i 

Q:=l-P. (22) 
For an arbitrary density operator p and any observable A it follows that 

Ti{pA} = Tr{(P + Q)p{P + Q)A} = Ri + R2 + R3 (23) 

with 

Ri := Tv{PpPA}, (24) 

R2 := Tt{QpA}, (25) 

i?3 := Tt{PpQA}. (26) 

Making use oi = P and the cyclic invariance of the trace, one can rewrite Ri as 
Tr{(PpP)(PAP)} or 

Pi = Tr{pi}, (27) 
p ■.= PpP, (28) 
A := PAP. (29) 

To further evaluate R2 we represent the self-adjoint operator p in terms of its 

eigenvalues p^, and eigenvectors \(t)u), 
00 

P = $^P.|0v)(0.|- (30) 

Since p is non-negative, all p^ are non-negative and 

00 

//':=Ev^l<^-)<'^'^l (31) 
i/=i 

is a well-defined, self-adjoint operator with the property that p^/^p^/^ = p. It follows 
that 

IP2P = |Tr{(Q//2)(//M)}|^ ^ Tr{QpQ}TT{ApA}, (32) 
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where we exploited the Cauchy-Schwarz inequahty 

\Ty{B^C}\' ^ Tr{5^5} Tr{C"fC} (33) 

for the scalar product Tr{i?^C} of arbitrary operators B and C (for which all traces in 
(33) exist). The last term in (32) equals Tr{pA^}, and by evaluating the trace by means 
of the orthonormal basis |0„,) one can infer with (11) and (30) that 

Tr{pA2} ^ Tr{p} ^ pf , (34) 

where we exploited that Tr{p} = 1 and \\A^\\ = ||v4|p in the last relation. Finally, we 
conclude from (21) and (22) that 

oo 

Tt{QpQ} = J2 (^\P\^) (35) 

u=d+l 

and hence 

oo 

\R2\' ^ WAf (Hpk)- (36) 

i/=d+l 

Next, starting from (26), we rewrite -R3 as Tt{APpQ}. Noting that all four 
operators under this trace are self-adjoint and that TrlfiT} = Tt{B}* for arbitrary 
operators B, we can conclude that I-R3I = |Tr{QpPyl}|. Proceeding similarly as in 
(32)-(34) and using Tr{p} ^ 1 one finds that 

00 

< Tr{Qpg}||A|pTr{p} ^ ||Af {u\p\u). (37) 
This in turn implies via (23)-(29) that 

°° 1/2 

I Tr{pA} - Tr{pi}| ^ 2\\A\\ ( (HpI^)) (38) 

i/=d+l 

for arbitrary density operators p. Applying this relation to the specific density operators 
p(t) in (6) and u in (19) we obtain 



with 



I Tr{p(t)A} - Ti{ujA}\ = I Tr{p(t)i} - Tr{£l;i}| + R (39) 

R ^ I Ti-{p{t)A} - Tt{ujA} - Tr{p(t)i} + Tr{£l;i}| (40) 

°° 1/2 

^4P||( 5^ (Hp(O)Iz.)) . (41) 

u=d+l 

In (40) we exploited the fact that |x| — \y\ ^ — 1/| for arbitrary real x and y, while in 
(41) we applied (38) to p{t) and u and we took into account that, according to (6) and 
(19), both {h'\p{t)\iy) and (i/|u;|z/) are equal to (z/|p(0)|z/). 

Observing that Ylt=ii^\p(^)\^) increases with d and approaches Tr{p(0)} = 1 for 
c? — )■ 00 it follows that, for any given e > 0, there exists a finite d{e) with the property 
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that R ^ ||^||e- According to (10), upon choosing e = 10 and observing (20), we can 
conclude that 

R ^ (£4/2 (42) 

with one common, finite d for all experimentally realistic observables A. 

As far as experimentally resolvable differences (18) are concerned, it thus fol- 
lows from (39) and (42) that it is sufficient to consider, instead of p{t) and A, 
their counterparts p{t) and A. According to (20), (28) and (29) these are the 
projections/restrictions of the original operators p{t) and A onto the finite dimensional 
sub-Hilbert space Ti G Ti, spanned by the first d energy eigenvectors {|i^)}^=i- Note 
that while Ti is independent of A, it does depend on p(0) and H in general. 

It remains to be shown that the entire framework set out in section 2 can be 
consistently restricted to the finite-dimensional Hilbert space H: Observing that the 
projectors P„ from (1) commute with the projector P defined in (21) implies that 

P„ := PP„P = PP„ = P„P (43) 

Setting H := PHP and keeping only indices n with non-zero P„, relations (l)-(5) 
remain valid, but now with finite sums and finite multiplicities fin- Furthermore, 
one sees that H indeed reproduces the correct time evolution of p{t) with finite sums in 
(6). While p(t) is still non-negative and self-adjoint, the trace now satisfies 

Tr{p(^)} = E^"^l- (44) 

n 

Likewise, with respect to the operator norm (11) and the level population (12) one finds 
that 

^ Pll, Pn^Pn- (45) 

4. Finite-dimensional systems 

The main objective of this section is to establish bounds on the difference 

a{t) := Tr{p(t)i} - Tr{cui}, (46) 

where tildes indicate the projections/restrictions to the finite-dimensional Hilbert space 
T-L from the previous subsection in case the original Hilbert space "H was infinite- 
dimensional (otherwise the tildes are redundant). We recall that d < oo denotes the 
dimension of "H (see below (42)) and ds ^ d the number of distinct energy eigenvalues 
En of H (see below (4)). 

Adopting the approach of Short and Farrelly [7], we start by considering the 
quantity {a'^{t))T, where {■)t denotes a temporal average over the time interval [0,T] 
with arbitrary but finite T > 0. From (6) and (19) we can infer that 



= ( E Tr{p™„i} exp [-i(E„ - En)t] ' ) , (47) 
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where Pmn(O) is abbreviated as pmn and the sum runs over the finite set of pairs of labels 
Q := {(m, n)\m,n E [1, • • • , o^e], m 7^ n}. (48) 
For any a = (m, n) G ^ we define 

:= - En, Va := Tr{p^„i}. (49) 
We thus can rewrite (47) as 

{d\t))^ = / |^t;,exp [-iGj] ) = J]<M„^w^, (50) 

where we introduced the self-adjoint, non-negative, finite-dimensional matrix M with 
matrix elements 

M^f, := (exp [i(G„ - Gf,)t\)^. (51) 

Denoting by ||Af|| the standard operator norm of the matrix M (see (11)), it follows 
that [7] 

(a2(t))y ^ 5||Af|| (52) 

S ■.= Y,W\^ =Y}'^Ap^n~A}\\ (53) 

Bounds on the two factors S and ||M|| in (52) are constructed in the following two 
subsections. 

4.1. Bound on S 

We exploit (2), (7), and the cyclic invariance of the trace to conclude that 

IliPmnA} = 1l{PrnpPnAPm]. (54) 

Similarly to the derivation in (30)-(32), we write 

Pn^pPnAPrn = {PrnP^'^){p^'^ PnAP^,) (55) 

from which it follows that 

I Tr{p™„i}|2 ^ Tr{p„„iP„,i}. (56) 
We first evaluate by means of (56) all summands in (53) with n = 1, 

Sn=l^ Pm TljpllAPmA} 

^ maxpnTT^puA^PmA^ 
^ maxp„Tr{pii} U V'P^i 



m^2 

^ maxp.„|| (57) 

In the second line we used that pm ^ Pm ^ niax„j.2Pn for all m ^ 2, see (45). The 
third line is based on a similar line of reasoning as in (34), exploiting that pu is a 
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non-negative, self-adjoint operator. In the last line we used that Tr{pii} = ^ 1 (see 
(12) and (44)), that ||5C|| ^ ||i?||||C|| for arbitrary operators B, C of finite norm, and 
that ^mj>2 Pm is a projector and hence of unit norm. 

For symmetry reasons, the same estimate as in (57) applies for the summands with 
m = 1 in (53). The remaining summands in (53) satisfy m ^ n and m, n ^ 2. By 
including also those with m = n, the sum can only increase, resulting in 

S ^ 2PII Wpn + V'|Tr{Pm„i}|', (58) 

n ' 

m,n 

where the prime indicates that indices 1 are excluded from the maximization and the 
summation. Clearly, instead of this special index 1 we could have selected any other 
index as well. Thus, in agreement with section 2.5, the prime can and will be understood 
as excluding the index belonging to the maximally populated level. 

The remaining sum in (58) can be estimated by means of the two non-negative, 
self-adjoint operators 

pnn, := 2^ PmPm (59) 



Co' 



in the following way: 



J2 |Tr{p^,i}|' ^ TT{Cd'Au"A} ^ ^JTT{{Co'yA^} Ti{{Q")^A^} 



< VTr{(c:;0^}PPTr{(^")^}PP = T^r{pUj^ f^^^pl. (60) 




In the first inequality, we exploited (56), the next two ones follow by arguments similar 
to the ones in (32) and (34). The last equation is based on (2), (5), (7) and (59). 

Once again, the line of reasoning in (60) follows very closely that of Short and 
Farrelly [6, 7]. The main difference is that these authors focus, in a first step, solely 
on pure states and only in the end extend their result to mixed states via purification. 
Along this line, one actually arrives at a final result which is slightly different from (60), 
namely 

Y^m~PrnnA}\'^\\ArY!pl (61) 

m,n n 

Closer inspection shows that (60) and (61) agree if and only if all energies are non- 
degenerate. In any other case, one can show that the bound (61) is sharper than (60). 
On the other hand, one readily sees that Tr{p^„} ^ p,^ and hence 



'5^ Tr{p2^}^ p^p^ ^ max'v^^ (62) 

n m n 

The equality sign applies whenever p{t) is a pure state contained in the maximally 
degenerate energy eigenspace, in any other case the inequality sign applies. In 
conclusion, (61) outperforms (60) by at most a factor of max^-y/p^, i.e. the square 
root of the maximal energy degeneracy. 
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The purification argument [6, 7] behind (61) is mathematically very appealing 
(somewhat reminiscent of evaluating real integrals by 'complexification') but its physical 
content remains slightly mysterious. It is reassuring that one can get at least as far as 
(60) without this argument, but it is annoying that (61) could not be fully recovered. 

Working with the stronger bound (61) by Short and Farrelly, the estimate (58) for 
S in combination with (15) and (45) takes on the simple form 

S ^ 3pf max'p„. (63) 

n 

While the maximization so far includes only a finite number of indices n, this restriction 
can be readily released, as the maximum can only increase in this way. As in section 2.5, 
the last factor then represents the second largest level population of the original, possibly 
infinite-dimensional system. 

4-2. Bound on ||M|| in terms of energy gaps 

The main idea is that, since M is a finite-dimensional matrix, its operator norm ||M|| 
converges towards a well-defined limit as the averaging time T (see (47), (51)) tends to 
infinity. Hence, ||M|| can be readily bounded from above for all sufficiently large (but 
finite) T. 

Quantitatively, by setting out from the inequality 

||M|| ^max^lM^^I, (64) 

a 

Short and Farrelly [7] derived the estimate 

P/|| < N(.) (l + ^i^") (65) 

for arbitrary e > and T > 0. Here, (Ie is the number of distinct energy eigenvalues En 
(see below (46)) and 

N{e) := max|{a eg\Gae[E,E + e)}\. (66) 

E 

According to (48) and (49), A^(e) is thus the maximum number of energy gaps 
Ga = Em — En in any interval of size e. 

The main implication of this result can also be deduced directly from (64): since 
we are dealing with finite-dimensional systems, the finite number of all matrix elements 
|May3| in (51) with Ga 7^ Gjs can be simultaneously bounded by an arbitrarily small 
upper limit for sufficiently large T. Hence, their contribution to (64) can be made 
smaller than the contribution of all the remaining summands, satisfying G^ = Gp and 
thus Map = 1. It follows for all sufficiently large T that 

\\M\\<:2g, (67) 

where 

g := max |{a G ^ | Ga = Gjs}] (68) 
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denotes the maximal degeneracy of energy gaps. Note that only the energy eigenvalues 
En of the restricted, finite-dimensional Hamiltonian H contribute to Q in (48) and hence 
to the degenerate energy gaps counted in (68). 

4-3. Problems in the infinite- dimensional limit 

So far, we have assumed a finite dimensionality d of the Hilbert space %. In this 
subsection we argue that it is intuitively suggestive that everything 'should go well' 
upon letting d go to infinity, but that a more rigorous justification is problematic. From 
the latter viewpoint, the considerations in section 3 are thus indispensable for infinite- 
dimensional systems. 

First of all, since A has a finite range A^, see (8), it follows that a{t) from (18) 
is contained in the finite interval [— 2A^, 2A^] for all times t. This suggest (but does 
not prove) that the temporal average (^^(t))^ converges in the limit T — )■ oo even for 
infinite dimensional systems: although one can readily construct mathematical examples 
of bounded functions without a well-defined infinite-time average, it appears plausible 
that 'reasonable' physical models will result in functions o"^(t) which do not exhibit the 
pathologies of those examples. 

On the other hand, for any given finite dimension d, equation (50) has a well defined 
T — oo limit. Focusing on the simplest case of non-degenerate energy gaps. Map in 
(51) approaches 5^^ and hence the inequality (52) turns into an equality with ||Af || = 1. 
Since 5* in (53) is positive and bounded by the ^-independent estimate (63), it is once 
again suggestive that S itself converges for d — oo. Under the further assumption that 
the two limits T — oo and d ^ oo commute, one then readily finds an upper bound 
analogous to (69) for infinite dimensions and all sufficiently large, but finite, T. 

Although these heuristic arguments appear plausible at first glance, some subtle 
open questions remain upon closer inspection: For infinite-dimensional systems one 
typically expects the existence of arbitrarily small, but non-vanishing energy gap 
differences Ga — Gp. While each single matrix element (51) then still converges for 
T — )■ oo, the same is no longer clear for the entire, infinite-dimensional matrix M 
and/or its norm ||M||. For the same reason, A^(e) from (66) is expected to diverge for 
d ^ oo and any fixed e > 0, so that (65) becomes useless. Likewise, our derivation of 
(67) breaks down. In other words, interchanging the limits T — )■ oo and d — ?■ oo is a 
rather delicate, unsettled issue. 

5. Main result and discussion 

Combining (20), (52), (63) and (67), we obtain the inequality 

{a\t))^ ^ IgA^max'pn (69) 

n 

for all sufficiently large T. 
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For any given e > and T > we define the measure of all times t G [0,T] for 
which \d'{t)\ ^ e holds true, 

T, := \{{t\te [0,T] and \a{t)\ ^ e}), (70) 

where A denotes the Lebesgue measure. It follows that 5"^(t) ^ for a set of times t of 
measure T^ and cr^(t) ^ for all remaining times t in [0, T]. Hence the temporal average 
of cr^(t) over the time interval [0,T] must be at least e^T^/T, 

^ (71) 
Choosing e = &4/2, we can conclude from (69) and (71) that 

%H,6,(^)\WP. (72) 

for all sufficiently large T. 

Next we infer from (39), (42) and (46) that 

|Tr{p(t)A}-Tr{c.A}|^|^(t)| + ^. (73) 

Analogously to (70), we define the measure of all times t G [0, T] with the property that 
1 1i{p{t)A} — Tr{u;A}| exceeds the experimental resolution limit 5A, 

TsA := X{{t 1 1 G [0,T] and | Ti{p{t)A} - Tt{ujA}\ ^ 6A}). (74) 

In view of (70) with e = 6A/2 and (73), we conclude that Tsa ^ Tsa/2- With (72) we 
thus arrive at the main result of our present work, 

^ ^ (^) ^f"'^- ^^^^ 
for all sufficiently large T. 

The left hand side of (75) represents the fraction of all times t G [0,T] for which 
there is an experimentally resolvable difference between the true expectation value 
TT{p{t)A} and the time-independent 'equilibrium expectation value' TrjcuA}. 

On the right hand side, g is the maximal degeneracy of energy gaps from (68), i.e. 
the maximal number of (exactly) coinciding energy differences among all possible pairs 
of distinct energy eigenvalues of the reduced, finite-dimensional Hamiltonian H from 
section 3, and as such is determined by properties of both the Hamiltonian H and the 
initial condition p(0). Alternatively, one may also take into account all energies of the 
full system Hamiltonian H, since the maximum can only increase in this way, but this 
increase might possibly become prohibitively huge for infinite dimensional systems (see 
also section 6). 

The next factor Aa/SA appearing in (75) is the range-to-resolution ratio of A from 
section 2.3, i.e. a characteristic property of the observable A only, and can be considered 
as bounded according to (10) for all experimentally realistic measurements A. Going 
back to section 4.1, one readily sees that one could as well replace by the range of 
the reduced observable A. This range typically is somewhat smaller than the original 
range of A (see (45)), but this gain might often not be worth the effort. 
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Finally, max„'p„ in (75) stands for the second- largest, ensemble-averaged occupa- 
tion probability of the (possibly degenerate) energy eigenvalues see sections 2.4 
and 2.5. Similarly as for A, one alternatively could maximize over the reduced level 
populations p^, but often this will not be worthwhile. Essentially, max„'p„ is thus a 
characteristic property of the initial condition p(0), but obviously also the Hamiltonian 
H itself matters. Typically, one expects that the rough upper bound (14) applies, except 
if certain energy eigenvalues are so extremely highly degenerate that the multiplicities 
defined in (5) severely reduce the pertinent energy level density compared to the non- 
degenerate case, see section 2.4. 

For a system with sufficiently many degrees of freedom / and no exceedingly large 
degeneracy of the energy eigenvalues and the energy gaps, we thus can conclude from 
(75) with (10) and (14) that the system behaves in every possible experiment exactly 
as if it were in the equilibrium state uj for the overwhelming majority of times within 
any sufficiently large (but finite) time interval [0,T]. In particular, T must obviously be 
much larger than the relaxation time in case of a far-from-equilibrium initial condition 
p(0). A more detailed quantitative bound on T follows from the result (65) by Short 
and Farrelly [7]. 

We emphasize that even in the absence of any measurable difference between p{t) 
and w, the equilibrium state u itself is never realized in the actual system, and as such is 
a purely formal, theoretical construct. In particular, the difference between Ti{p{t)A\ 
and TrjcuA} is not a quantity one can measure in a real system, not even in principle. 

Further interesting physical implications of (75) are discussed, in e.g., [5]. 

6. Conclusions 

To summarize, by adopting and extending recent works by Short [6] and by Short and 
Farrelly [7], we demonstrated equilibration of isolated macroscopic quantum systems 
in the sense that deviations from a time-independent steady state are unmeasurably 
small for the overwhelming majority of times within any sufficiently large time interval. 
This conclusion applies for arbitrary systems with (countably) infinite dimensions, initial 
states exhibiting a macroscopic population of at most one energy level, and Hamiltonians 
without exceedingly large degeneracies of energy eigenvalues and energy gaps. 

As soon as a model includes at least one continuous degree of freedom (e.g. a 
spatial coordinate), the pertinent Hilbert space is necessarily of infinite dimension. If 
the system is furthermore of finite spatial extension (e.g. due to confining walls), the 
Hamiltonian H features a discrete spectrum and can be written in the form (1). Both 
conditions are clearly satisfied in almost any model which is not based on some extreme 
simplifications. Moreover, it is practically impossible to prepare a real system so that 
only a finite number of energy eigenstates is populated. Rather, generically an infinite 
number of them contributes with possibly small, yet finite, amplitudes. Reducing or 
truncating this situation in whatever way to finite dimensions is especially problematic 
with respect to rigorous statements about the exact quantum mechanical evolution 
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over arbitrarily long time intervals. More precisely, interchanging the limit of infinite 
dimensions with the long time limit, as discussed in section 4.3, is a subtle problem, 
justifying the detailed treatment of the infinite-dimensional case in section 3. 

Admitting systems with degenerate energy eigenvalues and energy gaps are among 
the most important steps forward achieved in [6, 7]. On the other hand, it is generally 
taken for granted that such degeneracies are absent in generic Hamiltonians, see e.g. 
[2, 3, 5, 12, 13, 42] and, in particular, section 3.2.1 of [43] and references therein. 
Roughly speaking, Hamiltonians with such degeneracies are of measure zero compared 
to 'all' Hamiltonians. They only arise in the presence of special reasons like (perfect) 
symmetries, additional conserved quantities, or fine-tuning of parameters, which can 
be ruled out for typical real systems provided they cannot be further decomposed into 
non- interacting subsystems [3]. It is reassuring that even some of those exceptions — 
namely those without exceedingly high degeneracies — are now covered in [6, 7] and in 
the present work. Accidental degeneracies due to fine-tuning of parameters should not 
lead to high degeneracies, but the quantitative effect of symmetries on level and gap 
degeneracies is not clear to the present authors. In order to better understand the 
occurrence of degeneracies, it appears indispensable to study specific examples. The 
harmonic oscillator would be one which is still not admissible (as the degeneracy of 
energy gaps is too large), but the hope that other relevant examples will be 'tame' 
enough seems reasonable. 
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